Generative adversarial simulator

نویسندگان

چکیده

Knowledge distillation between machine learning models has opened many new avenues for parameter count reduction, performance improvements, or amortizing training time when changing architectures the teacher and student network. In case of reinforcement learning, this technique also been applied to distill policies students. Until now, policy required access a simulator real world trajectories. In paper we introduce simulator-free approach knowledge in context learning. A key challenge is having learn multiplicity cases that correspond given action. While prior work shown data-free possible with supervised by generating synthetic examples, these approaches are vulnerable only producing single prototype example each class. We propose an extension explicitly handle multiple observations per output class seeks find as exemplars reinitializing our data generator making use adversarial loss. To best knowledge, first demonstration policy. This improves over state art on networks benchmark datasets (MNIST, Fashion-MNIST, CIFAR-10), demonstrate it specifically tackles issues input modes. identify open problems distilling agents trained high dimensional environments such Pong, Breakout, Seaquest.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Colorization of Grayscale Images Using Generative Adversarial Networks

Automatic colorization of gray scale images poses a unique challenge in Information Retrieval. The goal of this field is to colorize images which have lost some color channels (such as the RGB channels or the AB channels in the LAB color space) while only having the brightness channel available, which is usually the case in a vast array of old photos and portraits. Having the ability to coloriz...

متن کامل

Evolutionary Generative Adversarial Networks

Generative adversarial networks (GAN) have been effective for learning generative models for real-world data. However, existing GANs (GAN and its variants) tend to suffer from training problems such as instability and mode collapse. In this paper, we propose a novel GAN framework called evolutionary generative adversarial networks (E-GAN) for stable GAN training and improved generative performa...

متن کامل

Generative Adversarial Perturbations

In this paper, we propose novel generative models for creating adversarial examples, slightly perturbed images resembling natural images but maliciously crafted to fool pre-trained models. We present trainable deep neural networks for transforming images to adversarial perturbations. Our proposed models can produce image-agnostic and image-dependent perturbations for targeted and nontargeted at...

متن کامل

Generative Adversarial Nets

We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G. The training procedure for G is to maximize the probability of D making a mistake. This f...

متن کامل

Unrolled Generative Adversarial Networks

We introduce a method to stabilize Generative Adversarial Networks (GANs) by defining the generator objective with respect to an unrolled optimization of the discriminator. This allows training to be adjusted between using the optimal discriminator in the generator’s objective, which is ideal but infeasible in practice, and using the current value of the discriminator, which is often unstable a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Artificial Intelligence and Machine Learning

سال: 2021

ISSN: ['2789-2557']

DOI: https://doi.org/10.51483/ijaiml.1.1.2021.31-46